Approximate discrete-time schemes for the estimation of diffusion 
processes from complete observations 



J.C. Jimenez 

Instituto de Cibernetica, Matematica y Fi'sica 
Departamento de Matematica Interdisciplinaria 
Calle 15, no. 551, Vedado, La Habana, Cuba 
e-mail: jcarlos@icimaf.cu 



December 11, 2012 



Abstract 

In this paper, a modification of the conventional approximations to the quasi-maximum likelihood 
method is introduced for the parameter estimation of diffusion processes from discrete observations. 
This is based on a convergent approximation to the first two conditional moments of the diffusion 
process through discrete-time schemes. It is shown that, for finite samples, the resulting approximate 
estimators converge to the quasi-maximum likelihood one when the error between the discrete-time 
approximation and the diffusion process decreases. For an increasing number of observations, they 
are asymptotically normal distributed and their bias decreases when the above mentioned error does 
it. A simulation study is provided to illustrate the performance of the new estimators. 

1 Introduction 

The statistical inference for diffusion processes described by Stochastic Differential Equations (SDEs) is 
currently a subject of intensive researches. A basic difficulty of this statistical problem is that, except for 
a few simple examples, the joint distribution of the discrete-time observations of the process has unknown 
closed-form. To overcome this, a number of estimators based on analytical and simulated approximations 
have been developed during more than three decades. Such methods are the focus of a growing literature. 
See, for instance, the review papers by Bibby and Sorensen (1996), Prakasa-Rao (1999), Nielsen et al. 
(2000) and Jimenez et al. (2006). 

In particular, the present paper deals with the class of quasi-maximum likelihood (QML) estimators for 
the parameter estimation of SDEs given a time series of complete observations. These are the estimators 
obtained by maximizing a normal log- likelihood function when the assumption of normality is not satisfied 
and all the components of the diffusion process are discretely observed. The simplest approximations to 
this class of estimators are derived, for SDEs with additive noise, from the likelihood of the discrete-time 
process defined by a numerical integrator. Typically, they are derived from the Euler-Maruyama scheme 
(Prakasa Rao 1983, Yoshida 1992, Florens-Zmirou 1989) or from the Local Linearization (LL) one (Ozaki 
1985, 1992; Shoji & Ozaki 1997, 1998). It have been demonstrated that, when the distance between 
two consecutive observations remains fixed, these pseudo-likelihood estimators are asymptotically biased 
when the number of observations increases (Florens-Zmirou 1989). However, a number of comparative 
studies among different estimation methods have shown that, in practical situations in which the distance 
between observations is small enough, the pseudo-likelihood estimators based on LL integrators display 
the better performance due to their simplicity, computational efficiency and negligible bias (see, e.g., Shoji 
& Ozaki 1997, Durham & Gallant 2002, Singer 2002, Hum et al. 2007). Therefore, any modification 
to these approximate QML methods that yields a bias reduction will be useful. In this direction, two 
methods have early been proposed. For the estimators based on the Euler-Maruyama integrator, Clement 
(1995) introduced a correction to the bias by means of simulations. Whereas, on the basis of Taylor 
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expansions of the first two conditional moments of the discrete-time process, Kessler (1997) archives 
similar results. Depending of the specific SDE to be estimated, the first method could be computationally 
time demanding, while the second one could be affected by numerical instabilities resulting from a high 
order Taylor expansion. More recently, Huang (2011) proposed new estimators based on high-order 
numerical integrators, which improve the accuracy of the approximation for the first two conditional 
moments and can be straightforward applied to SDEs with multiplicative noise. 

A common feature of the approximate QML methods mentioned above is that, once the observations 
are given, the error between the approximate and the exact moments of the diffusion is fixed and com- 
pletely determined by the distance between observations. Clearly, this increases the bias of the QML 
estimation for finite samples and obstructs its asymptotic correction when the number of observations 
increases. 

In this paper, an alternative modification of the conventional approximations to the QML estimator 
for diffusion processes is introduced, which is oriented to reduce and control the estimation bias. This is 
based on a recursive computation of the first two conditional moments of discrete-time approximations 
converging to the diffusion process between two consecutive observations. It is shown that, for an increas- 
ing number of observations, the resulting estimators are asymptotically normal distributed and their bias 
decreases when the error of the discrete-time approximation does it. 

The paper is organized as follows. In section 2, basic notations and definitions are presented. In section 

3, the new approximate estimators are defined and some of their properties studied. As a particular 
instance, the order-/? QML estimator based on the Local Linearization schemes is presented in Section 

4, as well as algorithms for its practical implementation. In the last section, the performance of the new 
estimators is illustrated with various examples. 

2 Notation and preliminary 

Let (fi,.F, P) be the underlying complete probability space and {J^t, t > to} be an increasing right 
continuous family of complete sub <T-algebras of J 7 . Consider a <i-dimensional diffusion process x defined 
by the following stochastic differential equation 

m 

dx(t) = {(t,Tc(t);9)dt + '^2gi(t,x(t);8)d\v l (t) (1) 

i=l 

for t > to £ M, where f and gi are differentiable functions, w = (w , ..,w' m ) is an m-dimensional Tt- 
adapted standard Wiener process, £ T>g is a vector of parameters, and Vg C MP is a compact set. 
Linear growth, uniform Lipschitz and smoothness conditions on the functions f and gi that ensure the 
existence and uniqueness of a strong solution of JT]) with bounded moments are assumed for all £Vg. 

Denote by z the diffusion process defined by ([1]) with 9 = 9q £ Vg, and suppose that M observations 
of the process z on an increasing sequence of time instants {£}m = {tk ■ tk < tk+i, k = 0, 1, .., M — 1} 
are given. More precisely, denote by z^ the observation of the process z at tk for all tk £ {£}m and by 
Z = {zo, .., Zjw-i} the sequence of these observations. 

The inference problem to be consider here is the estimation of the parameter 6q of the SDE ([1]) given 
the time series Z. In particular, let us consider the quasi-maximum likelihood estimator defined by 

d M = a,rg{xmnUu(e,Z)} (2) 
o 

where 

M-l 

U M (6, Z) = (M - 1) lnf» + ln(det(£ fe )) + (z fe - /x fc ) T (S fc )" 1 (z fc - Mfc ), 

k=l 

and fi k = _E(x(tfc)|zfc_i) and Sfc = £'(x(ife)x T (tfc)|zfe_i) — denote the conditional mean and variance 
of the diffusion process x at tk given Zk-i, for all tk-i,tk £ {t}u and 8 £ T>g. Because the first two 
conditional moments of x are correctly specified, the score of the normal log-likelihood satisfies the 
martingale difference property, and so the QML estimator @ is consistent and has an asymptotically 
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normal distribution. See Bollerslev & Wooldridge (1992) and Wooldridge (1994) for ergodic and no 
ergodic processes, respectively. 

In general, since the conditional mean and variance of equation ([1]) have not explicit formulas, ap- 
proximations to them are needed. If ]l k and Sfc are approximations to fi k and then the estimator 

1? M = arg{mint7 M (0, Z)}, 
o 

with 

M-l 

U M (d, Z) = (M - 1) ln(27r) + ^ In(det(S fc )) + (z fc - /i fc )T(£ fc )-i( Zfc _ £ fc ) 

fc=i 

provides an approximation to the quasi- maximum likelihood estimator 6m- 

Approximate estimators of this type have early been considered in a number of papers (Prakasa Rao 
1983; Florens-Zmirou 1989; Yoshida 1992; Ozaki 1985, 1992; Shoji & Ozaki 1997, 1998^ and recently 
in Huang (2011). In all of them, the approximate mean £t fe = _E(yfc|zfc_i) and variance Sfc = E({y k — 
A*fc)(yfe — A*fc) T l z fc-i) are derived from a discrete-time scheme = z k -i + <fi(tf.-i,Zk-i,tk — tfc-i) that 
approximate to x(ifc) in just one step of size tk — tk—i from the observation z^—i. Indistinctly, these 
estimators are called pseudo-likelihood estimators, or minimum contrast estimators or prediction error 
estimators depending of the inferential considerations that want to be emphasized. It has been proved 
(Florens-Zmirou, 1989) that, for the time partition {£}m = {tk = kS : k = 0, 1, • • • , M — 1} with S > 
fixed, these estimators are biased as MS — > oo. Contrary, they are asymptotically unbiased on the time 
partition {£}m = {tk = k&M '■ k = 0, 1, • • • , M — 1} in the case that MSm — > oo, but with M d 3 M — > (or 
more accurately with MS M — > as in Yoshida, 1992). However, last restriction on M and Sm imposes 
too strong relation between the number of observations and the distance between them, which is very 
inconvenient from a practical viewpoint. Further note that, once the data Z are given (and so the time 
partition {£}m is specified), the error between y k and x(i^) is completely settled by t k — t k -\ and can 
not be reduced. In this way, the difference between the approximate quasi-maximum likelihood estimator 
#m and the exact one 6m can not be reduced neither. 

Denote by Cp(M d ,]R) the space of I time continuously diffcrcntiable functions g : M. d — > R for which g 
and all its partial derivatives up to order / have polynomial growth. 



3 Order-/? quasi-maximum likelihood estimator 



Let (t)/ 1> o = { T n '■ T n+i ~ T n < h, n = 0, 1, . . . , N} be a time discretization of [to, £m-i] such that (r) h D 
{£}m) an d y n be the approximate value of x(t„) obtained from a discretization of the equation (TTJ) for 
all T n G (t)i. Let us consider the continuous time approximation y = {y(t), t 6 [io,ijwf— l] : y( T »0 = Yn 
for all r n £ { T )h\ °f x w ith initial conditions 



E y (to) \Ft )=E[ x(t ) Tt Q and El y (i )y T (to) ^ 



£ x(t )x T (i„) J 1( 



satisfying the bound condition 



E{\y(t)\ 2q \T t0 )<L 



for all t € [to,tM-i]', and the weak convergence criteria 



sup 

t k <t<t k 



E ff(x(t)) \Tt k ) -E \ g{y(t)) T t 



(3) 



(4) 



for all t k ,t k +i G {£}m and £T>g, where g g Cp R), L and are positive constants, /3 G N + , 

and q = 1,2.... The process y defined in this way is typically called order-/? approximation to x in 
weak sense (Kloeden & Platen, 1999). In addition, the second conditional moment of y is assumed to be 
positive definite and continuous for all 6 G T>g. 

When an order-/3 approximation to the solution of equation (TTJ is chosen, the following approximate 
quasi-maximum likelihood estimator can be naturally defined. 
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Definition 1 Given a time series Z of M observations of the SDE {Ip with 6 — 6$ on {t}M, the order-(3 
quasi-maximum likelihood estimator for the parameters of fTD is defined by 



6 M {h) = sxg{mmU M ,h{e, Z)}, (5) 
o 

where 

M-l 

U M ,h{e,Z) = (M-l)ln(27r)+ J2 ln(det(S, i , fc )) + (z fc -/x, a ,) T (i: /l , fc )- 1 (z fc -^ fc ), 

k=l 

Vh.k = E(y(tk)\zk-l), ^M = E(y(t k )y T (t k )\z k _ 1 ) - ^ h ^ h k , y is an order-f3 approximation to the 
solution of (QP in weak sense such that £(y(tfc)|zfe) = z k and -E(y(ifc)y 1 {tk)\ z k) = z fczJ for all t k € {t}M, 
and h is the maximum stepsize of the time discretization {r) h D {£}m associated to y. 

In principle, according to the above definition, any kind of approximation y converging to x in a weak 
sense can be used to construct an approximate order-/3 quasi-maximum likelihood estimator, e.g., those 
considered in Kloeden & Platen (1999). In this way, the Euler-Maruyama, the Local Linearization and any 
high order numerical scheme for SDEs might be used as well, but the approximations fih.k and Hh,k will be 
now derived from the conditional moments of the numerical scheme after various iterations with stepsizes 
lower than t k — tfc— i- Note that, when (tL = {£}m ; the so defined order-/? quasi-maximum likelihood 
estimator reduces to the corresponding approximate quasi-maximum likelihood estimator mentioned in 
Section [5J That is, to one of those considered in Prakasa Rao (1983), Yoshida (1992), Florens-Zmirou 
(1989), Ozaki (1985,1992), Shoji k Ozaki (1997,1998), or Huang (2011). 

3.1 Convergence 

For a finite sample Z of M observation of (Q]) , the following convergence result is useful. 

Theorem 2 Let Z be a time series of M observations of the SDE with 8 — 9 on the time partition 
{£}m- Let 6m and Om{K) be, respectively, the quasi-maximum likelihood and an order-/3 quasi-maximum 
likelihood estimator for the parameters of (QP given Z . Then 



9 M (h)-e M 
as h — > 0, for all fixed Z . 

Proof. Defining AHh,k = — Hh,k, it follows that 







det(E M ) = dct(S fc - AE M ) 

= det(E fc ) det(I - X AE M ) (6) 



and 



= S^ + S^AE^P-E^AEa,*)- 1 ^ 1 . (7) 
By using these two identities and the identity 

( z * -Mh,fc) T ( s ft,fe) _1 ( z fe -Mm) = ( z fc-^) T (EM) _1 (^-ft) 

+( z fc - Mk) T ( s M) -1 (Mk - Mm) 

+(Mk " Mm) T ( S m) -1 ( z * ~ Mfc) 

+(Mfc - MM) T ( s M) -1 (Mk - Mm) ( 8 ) 

it is obtained that 

U M , h (0, Z) = U M (0, Z) + R u , h (0), (9) 
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where Um and Um,k are denned in ([2]) and ([5]), respectively, and 



M-l 



RM,h{0) = ln ( dct (I - Sfc^Efc.k)) + (z fe - Mfe )T M/i fe ( Zfc _ Mfc ) 

^,fe) T ( s /.,fe) _1 ( z fe - Mfc) 



fc=i 



+( z fc -Mfe) T ( s ^,fc) l {^ k - H Kk ) + {n k 



with M M = Ej^ 1 AE h)fc (I - H^AE^k)- 1 !! 
Condition (QJ directly implies that 



-l 
k ■ 



|£7(x(tfc)|z fc _i) - B(y(t fc )|z fc _i)| < £fe-i^ 



and 



|£(x(t fc )xT(i fc )|z fc _i) - £(y(t fc )y T (t fc )|z fe -i)| < L k ^h fj . 
From this and the finite bound for the conditional mean of x and y, it is obtained that 

\&k -Mii,fcl ^ and |S fc -S ft ,fe|^0 

as /i — >■ for all 9 E T>g and A; = 1, .., M — 1. This and the hnite bound for the first two conditional 
moments of x and y imply that Ru,h{6) — > as well with h. From this and ([9]). 



e M {h)-e 



M 



arg{min{[/ M (0, Z) + Rmm(0)}} - arg{minC/ Af (6>, Z)} 





o 



as h — > 0, which concludes the proof. ■ 

The above theorem states that, for each given data Z, the order- j3 QML estimator 0M(h) converges 
to the exact one Om as h goes to zero. Because h controls the weak convergence criteria Q is then clear 
that the order-/? QML estimator ((SJ) converges to the exact one (J2J) when the error (in weak sense) of 
the order-/? approximation y to x decreases. This is a remarkable positive feature that the conventional 
approximate QML estimators mentioned in Section [2] do not have. 



3.2 Asymptotic properties 

In this section, asymptotic properties of the approximate quasi- maximum likelihood estimator OmQi) will 
be studied by using a general result obtained in Ljung and Caines (1979) for prediction error estimators. 
According to that, the relation between the estimator 8m(}i) and the global minimum 9* M of the function 

W M (0) = E{U M {0, Z)) with 6 eV e (10) 

should be considered, where Um is defined in ([2]). Here, it is worth to remark that 6* M is not an estimator 
of since the function Wm does not depend of a given data Z. In fact, 0* M indexes the best predictor, 
in the sense that the average prediction error loss function Wm is minimized at this parameter (Ljung & 
Caines, 1979). 

In what follows, regularity conditions for the unique identifiability of the SDE ([T]) are assumed, which 
are typically satisfied by stationary and ergodic diffusion processes (see, e.g., Bollerslev & Wooldridge 
(1992) or Ljung & Caines (1979)). 

Lemma 3 If Sfc is positive definite for all k = 1, .., M — 1, then the function Wm(0) defined in U0\) has 
an unique minimum and 

arg{minWAf(0)}=0 o . (11) 

Proof. Since Sfc is positive definite for all k — 1, .., M— 1, Lemma A. 2 in Bollerslev & Wooldridge (1992) 
ensures that 9q is the unique minimum of the function 

l k (8) = £(ln(det(£ fc )) + (*k - M fc ) T (S fe ) _1 (zfe - /**)|z fc -i) 
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on T>g for all k. Consequently and under the assumed unique identifiability of the SDE (JTJ) , 6q is then 
the unique minimum of 

M-l 

W M (0) = (M - l)ln(27r) + E(h(0)) 

k=l 

on Vg. m 

Here, it is worth to remark that the result of this Lemma is restricted to the QML estimator 
for SDEs. However, for other types of stochastic processes, a similar result can be found in the proof 
of Theorem 2.1 of Bollerslev & Wooldridge (1992) concerning the asymptotic properties of the QML 
estimator under more general framework. 

Denote by U' M h the derivative of Um,h with respect to 8, and by W M the second derivative of Wm 
with respect to 8. 

Theorem 4 Let Z be a time series of M observations of the SDE flj) with 8 = 8 on the time partition 
{£}m- Let Om{Ji) be an order-/3 quasi-maximum likelihood estimator for the parameters of (Qp given Z. 
Then 

M (h)-0 o ^ A8 M (h) (12) 

w.p.l as M — > oo, where A0j\,/(/i) — ► as h — > 0. Moreover, if for some M G N there exists e > such 
that 

w' M {8)>el and Hmji(8) = ME{U' Mh {8,Z){U' Mh {6,Z)) 1 ) > el (13) 
for all M > Mq and 8 £ Vg, then 

VMPlj/ h 2 (8M(h) - Oo) ~ N{A8 M (h), I) (14) 

asM^oo, wherePMM = {W M (0 O + A0 M (/i)))" 1 H M ,/ t (0o + A0 M {h)){W' M (8 o + Aduih)))- 1 + AP M ,h 
with AP M ,h — > as h — > 0. 

Proof. Let Wm,h{0) = E(UM,h(0, Z)) and cxM(h) = arg{ min Wm m(6)}, where Umm is defined in ([5]). 
For a ft, fixed, Theorem 1 in Ljung & Caines (1979) implies that 

M (h) - a M {h) -»• (15) 

w.p.l as A/ — > oo; and 

y/MP]£J?{a M {h)){d M {h) - a M {h)) ~7V(0,I) (16) 

as M — > oo, where 

Pma(0) = TO.hCe))- 1 H M , fc (0) wuwr 1 

with H M , h (0) = ME(U' M>h (9, Z){U' M>h {0, Z)Y). 
By using the identities dU)-©, the function 

M-l 



= (Af - l)ln(27r) + J2 ^(In(det(S fc>fc )) + (** - MmWEm)- 1 ^* - 
fc=i 

Wjf,fc(») - + E(R Mth (0)), (17) 



fc=i 

can be written as 



where Wm is defined in (fl"U|) and 



M-l 



i?M,hW - £ S(ln(det(I-S- 1 AS, l , fc ))|J- tfc _ 1 )+ J B((z fe - / x fe )TM, l , fc (z fe - Atfc )|J- tfc _ 1 ) 
fc=i 

+£((z fc - Mfe )T( SM )-iQ^ - ,i M )|JvJ + ^((M* - /i h)fc ) T (S fclfc )- 1 (zfc ~ Mfc)^) 
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with M M = E^AE^I - Ejr^AE,,,*) -1 ^* 1 and AE M = E fe - S M . 

Denote by h and h the second derivative of Wm.k and Rm,k with respect to 0. 
Taking into account that 

= (W&iO))- 1 +K Mth (0) 

with 

it is obtained that 

Pm,/»(0) - (^M(0))- 1 H Afift (0)(l^M(0))- 1 + £P M , h {6), (18) 

where 

APmj^) = K Mth (0)H Mih (8)(W , M{0))- 1 + (W r M(0))- 1 H A/ , ft (0)K M ,h(0) + K AI . h (e)H M . h (e)K M , h ^)- 
Condition directly implies that 

|£(x(t fe )|zfe-i) - £(y(t fc )|**-x)| < Lk^h 13 

and 

|S(x(t fc )xT(t fc )|z fc _i) - B(y(tfc)y T (tfc)|z fc -i)| < ife-ift' 3 . 
From this and the finite bound for the conditional mean of x and y, it is obtained that 

| A*fc — | — ^ and | E fc - E ft)fc | -J- 
as /i — >■ for all 9 E T)g and fc = 1, .., M — 1. This and the finite bound for the first two conditional 



moments of x and y imply that \RM,h{0, Z)\ — > and R" M h (9, Z) 
(fT?)) , it is obtained that 



as well with h. From this and 



W M ,h{0) -> W M {8) and W M , h {8) -> W M {8) as -> 0. (19) 
In addition, left (|T9]) and Lemma [3] imply that 

A6» M (W = « M (h) - O = arg{ min Wm h (#)} - arg{ min W M (8)} -> as -> 0, (20) 

whereas from right (fl9f follows that 

AP M ,ftW^0 as 7i->0. (21) 



Finally, (j20jl -([2T j) together {H}, y2J) and {THjl imply that {EJ and (H]) hold, which completes the 
proof. ■ 

Theorem [4] states that, for an increasing number of observations, the order- /3 QML estimator <?a/(/i) 
is asymptotically normal distributed and its bias decreases when h goes to zeros. This is a predictable 
result due to the known asymptotic properties of the exact QML estimator 9m stated in Bollerslev and 
Wooldridge (1992) and the convergence of the approximate estimator 9u{h) to 9m given by Theorem^ 
when h — > 0. Further note that, when h = 0, the Theorem 2] reduces to Theorem 1 in Ljung & Caines 
(1979) for the exact QML estimator 9m- This is other expected result since the order- (3 QML estimator 
#m(/j) reduces to the exact one 9m when h = 0. 
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4 Order-/^ QML estimator based on LL schemes 



Since, in principle, any type of approximation converging to the solution of (TTJ) in a weak sense can 
be used to construct an order-/3 QML estimator, some additional criterions could be considered for the 
selection of one of them. For instance, high order of convergence, efficient algorithm for the computation 
of the moments, and so on. In this paper, we elected the order-/? Local Linear approximations (see, 
e.g., Jimenez & Biscay, 2002, and Jimenez & Ozaki, 2003) for the following reasons: 1) their first two 
conditional moments have simple explicit formulas that can be computed by means of efficient algorithm 
(including high dimensional equations) as in Jimenez & Ozaki (2002,2003) and Jimenez (2012a); 2) their 
first two conditional moments are exact for linear equations in all the possible variants (with additive 
and/or multiplicative noise, autonomous or not), see Jimenez & Ozaki (2002); 3) they have an adequate 
order j3 = 1,2 of weak convergence (Carbonell et al., 2006 and Jimenez, 2012b); and 4) the better 
performance of the conventional QML estimators based on Local Linearization schemes due to their 
simplicity, computational efficiency and negligible bias (see, e.g., Shoji & Ozaki 1997, Durham & Gallant 
2002, Singer 2002, Hum et al. 2007). 

It is known that the first two conditional moments of the Local Linear approximations satisfy a set 
ordinary differential equations. Explicit formulas for the solution of these equations can be found in 
various papers as it was mentioned before. In what follows, the simplified expressions derived in Jimenez 
(2012a) are presented. 

Denote by y T „/t k — £"(y( T ri)l z *;) an d P T „/t t = E(y(T n )y J (r n )|zfc) the first two conditional moment 
of the order-/3 Local Linear approximation y at r„ given the observation for all r n 6 {(t)^ n [tjt, tfc+i]} 



and k — 0, .., M — 2. Clearly, y t 



and V, 



to the exact conditional mean and variance Sfc+i, respectively, for all tfc , ifc+i G {£}a/- Moreover, 

Yt k /t k = Zfc and P tk /t h = z kZ T k for all t k G {t} M - Let n t = max{n = 0, 1, . . . : r n < t and r n G (r) h } for 
all t G [ta,t M -i\- 

According to Jimenez (2012b), the approximate moments yt k+1 
evaluating the recursive formulas 



provide approximations 



and P f 



i/h 



are obtained by 



nt — 1 



(22) 



and 



nt — 1 



iP t/tk )= J2 L ie M ^)^+— )u T „, tfe +L 1 e M ^)( t -^)u T „ t , 



n=n t . 



at t = tjt+i, where the vector u T tfc and the matrices M(t), Li, L 2 are defined as 



M(r) = 



A(t 


) B 5 (r) 


Mr) 




Ba(r) 


Bx{r) ' 




«ec(P T/tfc 





C(r) 


1(1+2 






















C(r) 













r 














2 





j u r,t fc — 




















1 



























1 




Li = [ I d 


2 0^2. 


<(2d+7) 




L 2 =[ 


0dx(d 2 +d+2) 


Id 0d x5 



and 



in terms of the matrices and vectors 



A(t) = A(t)9A(t) + £ B 2 (r)®Bj(r), 



(23) 



o(d 2 +2d+7) 



C(r) = 



A(r) &1 (t) A(r)y Titfc +a (r) 
1 




i+2)x(d+2) 
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r T = [ lx(d+1) 1 ] 

Bi(r) = vec^)) + /3 4 (r)y T/tfc , B 2 (r) = «ec(/3 2 (r)) + /3 5 (r)y r/tfc , B 3 (r) = wec(/3 a (r)), B 4 (r) = B 4 (r)L 
and B 5 (t) = B 5 (r)L with 



/9 2 (t 
/9 8 (t 

05 (T 



X> )0 (r)bT (r) 

i=l 
m 

^koMb^rj+b^Mbyr) 

i=l 
m 

i=l 

m 

ao(r) 8 a (r) + ^ b^ (r) B,(r) + B,(r) <g> b^ (r) 

i=i 

m 

ai(r) 8 ai(r) + ^b M (r) ® B,(r) + B,(r) ® b M (r), 



i=l 



L = [ Id 0^x2 1, and the <i-dimensional identity matrix 1^. Here, 



A(r) 



9f(r,y T/t J 



and Bj(r) 



9Si(r,y T /t k ) 



dy ' <9y 

are matrices, and the vectors ao(r„ t ), ai(r nt ), b^o(7"n t ) and bi j i(r„ t ) satisfy the expressions 

^(t^nt) = a (r, it ) + a^TnJ^ - r„J and bf (t;r„J = b ii0 (T„ t ) + b»,i( r nt)(* _ T »J 
for all t £ [tfc , tfc+i] and r„ t € where 

for = 1 



a"(t;r) 



f( T v , 1 9f ( T .yx/tJ 3f(r,y r/tfc ) , , 



a^ + i E [G(r,y r/t JGT(r,y r/t J] 



3>/=l 



dy j dy l 



and 



bf (t;r) 



B(r,y(r)) - ^^V r/fr + »*g-"»> (f - r) 



(i - r) for = 2 



for /3 = 1 



H(t;r) + l E [G(r,y T/t JGT(r,y T/t J 



d*gi(T,y(T)) 



3,1=1 



dyi dy 



h^-it-r) for p = 2 



are the order-/3 Ito- Taylor expansions for the drift and diffusion coefficients of (JTJ) in the neighborhood 
of (t, y r / th ), respectively, and G = [g 1 , . . . , g m ] is an d x m matrix function. The symbols vec, ® and (g> 
denote the vectorization operator, the Kronecker sum and product, respectively. 

Under general conditions, Lemma 7 and Theorem 9 in Jimenez (2012b) state that the order-/? Local 
Linear approximation y satisfies the bound condition ([3]) and the weak convergence criteria ((3]). Hence, 
Theorem [2] implies that the order- p QML estimator 



6 M (h) = &rg{mmU M M d > z )}: 





(24) 



with 



M-2 



U Mlh (0,Z) = (M — 1) ln(27r) + £ ln(det(V tfc+l/t J) + (z fc+1 - y tfc+l/tfc ) T (V tfc+l/t J- X (z fe+1 - y tfc+l/t J, 



k=0 



converges to the exact one ^ as h goes to zero for all given Z. For the same reason, the order-/3 QML 
estimator (|24p has the asymptotic properties stated in Theorem 2) 
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For one-dimensional SDEs with additive noise, the order-/3 QML estimator (|24|) reduces to the con- 
ventional estimators of Ozaki (1985, 1992) and Shoji & Ozaki (1997, 1998) when (r) h = {t} M - It is 
worth to emphasize that, for each data z^, the formulas (|2"2")) - (|2"3")) are recursively evaluated at all the time 
instants r„ G {(t) a fl (ifc,ifc+i]} for the first estimator, whereas they are evaluated only at tk+i = {(t)^ 
n (ife, ife+i]} for the conventional ones. 

From computational viewpoint, each evaluation of the formulas (f22j) - (|2"3|) at r„ requires the compu- 
tation of just one exponential matrix whose matrix depends of the drift and diffusion coefficients of (|TJ) 
at (r„_i, y Tn _i /*>,)• This exponential matrix can the efficiently computed through the well known Pade 
method (Moler & Van Loan, 2003) or, alternatively, by means of the Krylov subspace method (Moler & 
Van Loan, 2003) in the case of high dimensional SDEs. Even more, low order Pade and Krylov methods 
as suggested in Jimenez & de la Cruz (2012) can be used as well for reducing the computation cost, but 
preserving the order- /3 of the approximate moments. Alternatively, simplified formulas for the moments 
can be used when the equation to be estimate is autonomous or has additive noise (see Jimenez, 2012a). 
All this makes simple and efficient the evaluation of the approximate moments Yt k+1 /t k 

and Vt h+1 /t k - 

In practical situations, it is convenient to write a code that automatically determines the time dis- 
cretization (t)u for achieving a prescribed absolute (atol y , atolp) and relative (rtol y , rtolp) error tolerance 
in the computation of yt k+1 /t k and Pt k+1 /t k ■ With this purpose the adaptive strategy proposed in Jimenez 
(2012b) is useful. 



5 Simulation study 

In this section, the performance of the new approximate estimators is illustrated, by means of simulations, 
with four test SDEs. To do so, four types of QML estimators are computed and compared: 1) the exact 
one ©, when it is possible; 2) the conventional one based on the LL scheme. That is, the estimator 
defined by (|2"4")l with (r) h = {£}m and j3 = 1; 3) the order-1 QML estimator (IM1) with various uniform 
time discretizations (r)^ T ; and 4) the adaptive order-1 QML estimator (j2"4"]l with the adaptive selection 
of time discretizations (r). T proposed in Jimenez (2012b). For each example, histograms and confidence 
limits for the estimators are computed from various sets of discrete observations taken with different time 
distances (sampling periods) on time intervals with distinct lengths. 

5.1 Test equations 

Example 1. Equation with multiplicative noise 

dx = atxdt + aVixdwi (25) 

where a — —0.1 and a = 0.1 are parameters to be estimated, and x(to) — 1 is the initial value of x at 
to = 0.5. For this equation, the conditional mean and variance of x at tk+i given the observation Zk of x 
at tk are 

Mfe+1 = % e"<*2 + *-*2>/ 2 and E A+1 = ^ e (°+- 2 /2)(4 +1 -^) _ ^ 

respectively, for all tk+i > tk > to- 

Example 2. Equation with two additive noise 



dx = atxdt + crt 2 e at2/2 dwi + pVtdw 2 (26) 

where a = —0.25, a = 5, and p = 0.1 are parameters to be estimated, and x(to) = 10 is the initial value 
of x at to — 0.01. For this equation, the conditional mean and variance of x at tk+i given the observation 
Zk of x at tk are 

and 

2 2 2 

+1 = 2a ~5~"*;+i _ t k) e 2a' 

respectively, for all tk+i > tk> t . 
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Example 3. Van der Pool oscillator with random input (Gitterman, 2005) 



dxi = Xidt 

dx,2 = — 1)%2 — xi + a)dt + adw 



(27) 
(28) 



where a = 0.5 and a 2 = (0.75) 2 are, respectively, the intensity and the variance of the random input that 
should be estimated. In addition, to = 0, and x T (io) = [11]. 

Example 4. Van der Pool oscillator with random frequency (Gitterman, 2005) 



where w = 1 and a 2 = 1 are, respectively, the frequency mean value and variance that should be 
estimated. In addition, to = 0, and x T (to) = [11]. 

In these examples, autonomous or non autonomous, linear or nonlinear, one or two dimensional equa- 
tions with additive or multiplicative noise are considered for the estimation of two or three parameters. 
Note that, since the first two conditional moments of the Examples 1 and 2 have explicit expressions, the 
exact QML estimator @ can be computed. 

These four equations have previously been used in Jimenez (2012b) to illustrate the convergence of the 
approximate moments (|22[1 - (|23[) by means of simulations. Tables with the errors between the approximate 
moments and the exact ones as a function of h were given for the Examples 1 and 2. Tables with the 
estimated rate of convergence were provided for the fours examples. 

5.2 Simulations with one-dimensional equations 

For the first two examples, 100 realizations of the solution were computed by means of the Euler (Kloeden 
& Platen, 1999) or the Local Linearization scheme (Jimenez et al., 1999) for the equation with multi- 
plicative or additive noise, respectively. For each example, the realizations where computed over the 
thin time partition {to + 10~ 4 n : n = 0, .., 30 x 10 4 } to guarantee a precise simulation of the stochastic 
solutions on the time interval [to, to + 30]. Twelve subsamplcs of each realization at the time instants 
{t}M,T — {tk = to + kT/M : k — 0, ..,M — 1} were taken as observation Z of x for making infer- 
ence with various values of M and T. In particular, the values T = 10,20,30 and M — T/S with 

5 = 1, 0.1, 0.01, 0.001 were used. In this way, twelve sets of 100 time series Z\ T = {z l k : k = 0, .., M — 1, 
M = T/S}, with i = 1, .., 100, of M observations z\ each one were finally available for each example with 
the twelve values of (5, T) mentioned above. This will allow us to explore and compare the performance of 
each estimator from observations taken with different sampling periods 5 on time intervals with distinct 
lengths T. 

Figure 1 shows the histograms and the confidence limits for both, the exact (af T ) and the conven- 
tional {ols.t) QML estimators of a computed from the twelve sets of 100 time series Z\ T available for 
the example 1. Figure 2 shows the same but, for the exact (a s T ) and the conventional (cts.t) QML 
estimators of a. As it was expected, for the samples Z\ T with largest sampling periods, the parameter 
estimation is distorted by the well-known lowpass filter effect of signals sampling (see, e.g., Oppenheim 

6 Schafer, 2010). This is the reason of the under estimation of the variance erg T from the samples Z\ T , 
with 6=1 and T = 10, 20, 30, when the parameter a in the drift coefficient of (|2"5)l is better estimated 
by a s T . Contrarily, from these samples, the conventional QML estimators ag,T can not provided a good 
approximation to a, and so the whole unexplained component of the drift coefficient of (|25[) included 
in the samples is interpreted as noise by the conventional QML estimators. For this reason, ag,T over 
estimates the value of the parameter a. Further, note that when the sampling period 5 decreases and the 
length T of the observation time increases, the difference between the exact (af T , dfj>) and the conven- 
tional (ot6,T> &S,t) QML estimators decreases, as well as the bias of both estimators. This is also other 
expected result. Here, the bias is estimated by the difference between the parameter value and the estima- 
tor average, whereas the difference between estimators refers to the histogram shape and confidence limits. 



dx\ = X2dt 

dx2 = {—(xi — l)x2 — vjxi)dt + axidw 



(29) 
(30) 



11 



8 = 1 


h = 5 


h = 5/2 


h = 5/8 


h = 5/32 


T = 10 
a T = 20 
T = 30 


5.7 ±3.7 x 10~ a 


1.3 ±0.9 x 10~ a 


2.2 ± 1.6 x 10~ 4 


5.4 ±4.0 x 10~ b 


3.7 ± 1.6 x 10~ a 


6.6 ±3.8 x 10~ 4 


8.2 ±7.6 x 10~ 5 


2.3 ±2.1 x 10~ 5 


5.8 ± 1.6 x 10~ a 


8.1 ±3.4 x 10~ 4 


5.4 ±4.6 x 10~ b 


1.7 ± 1.4 x 10~ b 


T= 10 
o- T = 20 

r = 30 


2.8 ±2.0 x 10~ 2 


8.6 ±5.2 x 10~ a 


1.7 ±0.9 x 10~ a 


4.0 ±2.1 x 10~ 4 


1.4 ± 1.3 x 1Q- 2 


4.5 ±2.9 x 10~ a 


9.4 ± 5.1 x 10~ 4 


2.2 ± 1.1 x 10~ 4 


8.4 ±8.7 x 10~ a 


2.6 ±2.1 x 10~ a 


6.2 ±3.6 x 10~ 4 


1.5 ±8.4 x 10~ b 



Table I. Confidence limits for the error between the exact and the approximate QML estimators of the equa- 
tion ((25}. h = 5, for the conventional; and h = (5/2,5/8,5/32, for the order-1 on (t)? t . 



5 = 1 


a 


a 


h 


T = 10 T = 20 T = 30 


T = 10 T = 20 T = 30 


5 


-0.0030 -0.0035 -0.0058 


-0.0286 -0.0150 -0.0071 


5/2 


-0.0006 -0.0005 -0.0008 


-0.0086 -0.0045 -0.0027 


5/8 


0.0001 


-0.0017 -0.0009 -0.0006 


(5/32 





-0.0004 -0.0002 -0.0002 




-0.0003 


-0.0002 -0.0002 -0.0013 



Table II: Difference between the averages of the exact and the approximate QML estimators for the equation 
(]25p . h = 5, for the conventional; h = (5/2,5/8,(5/32, for the order-1 on (t)l t ; and h = •, for the adaptive 
order-1 on (r) T . 

For the data of (|25|) with largest sampling period 8 — 1, the order-1 QML estimators (5)1^,5^^) 
and (5.,5,t, &-,S,t) on uniform (r)^ T = {r„ = to + nh : n = 0, ..,T/h} D {t}r/s.T and adaptive (r). T D 
{t}T/s,T time discretizations, respectively, were computed with h = 8/2,8/8,8/32 and tolerances rtol y = 



rtolp - 
errors 



5 x 10 6 and atoL 



5 x 10 , atoh 



5 x 10 
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For each data Z\ T , with i 



1, .., 100, the 



£i{a,h,8,T) 



*8.T ~ a h.8.T 



and £j(cr, h, 8, T) 



T S,T a h,S.T 



between the exact (af T , af T ) and the approximate (S^ s T , 



a\ s T ) QML estimators were computed. Av- 



erage and standard deviation of these 100 errors were calculated for each set of values h, 8, T specified 
above, which are summarized in Table I. Note as, for fixed T, the average of the errors decreases as h 
does it. This clearly illustrates the convergence of the order-1 QML estimators to the exact one stated 
in Theorem [2] when h goes to zero. In addition, Figure 3 shows the histograms and the confidence 
limits for the order-1 QML estimators (cE^ s T'&h S t) an d (S.,a,T> &-,s,t) for each set of values h,8,T. 
By comparing the results of this figure with the corresponding in the previous ones, the decreasing dif- 
ference between the order-1 QML estimators (2^ $ T , s T ) and the exact one (cif T ,af T ) is observed 
as h decreases, which is consistent with the convergence results of Table I. Similarly, for T = 10, 20, 
the difference between the order-1 QML estimators (2^ ^ T , a\ s T ) and the adaptive QML estimators 
(2.,<s,t, &-,$,t) decreases when h does it, due to the negligible difference between the adaptive QML es- 
timators (a.j,T,&-,s,T) with the exact ones (af T ,af T ). This illustrates the usefulness of the adaptive 
strategy for improving the QML parameter estimation for finite samples with large sampling periods. 
These findings are summarized in Table II, which shows the difference between the averages of the exact 
and the approximate QML estimators. Observe the lightly higher difference between the averages of the 
exact and the adaptive estimators for both parameters when T = 30. The reason is that, for % > 21, 
the variance of the diffusion (|25|) becomes almost indistinguishable of zero. This is so small that the 
roundoff errors becomes significant in such a way that the adaptive strategy under estimates the integra- 
tion errors and so over estimates the length of the step sizes. This can be seem in Figure 4, which shows 
the average of accepted and fail steps of the adaptive QML estimators at each tk € {t}r/s,T- Note that, 
for tk > 21, the number of accepted steps is lower that 8. Therefore, the adaptive estimator can not be 
so good as those on uniform time discretization with h = 1/8, 1/32 in Table II. Contrary, for tk < 21, the 
number of accepted steps is larger that 32, and the adaptive estimator performs better than all the others. 
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5 = 0.1 


a 


a 


P 


h 


T = 10 T = 20 T = 30 


T = 10 T = 20 T = 30 


T = 10 T = 20 T = 30 


5 


0.00048 0.00045 0.00043 


-0.0405 -0.0397 -0.0396 


-0.00015 4.4 x 10 _b 3.9 x 10 _b 


5/2 


0.00014 0.00013 0.00011 


-0.0110 -0.0100 -0.0100 


-0.00009 0.9 x 10" b 1.4 x 10 _b 


5/4 


0.00003 0.00004 0.00003 


-0.0025 -0.0023 -0.0022 


-0.00002 0.8 x 10 _b 1.0 x 10 _b 


5/8 


0.00001 


-0.0004 -0.0004 -0.0004 


0.5 x 10 _b 0.6xl0 _b 




-0.00008 0.00004 -0.00006 


0.0049 0.0010 0.0092 


0.00011 0.4 x 10 _b 



Table III: Difference between the averages of the exact and the approximate QML estimators for the equation 
(|26[l , h — 8, for the conventional; h = 6/2,5/4,8/8, for the order-1 on (t)l t ; and h = •, for the adaptive 
order-1 on (r). T . 



5 = 0.1 


/i = 6 


h = 5/2 


h = 5/4 


h = 5/8 


T = 10 
a T = 20 
T = 30 


6.6 ±4.9 x 10~ 4 


1.6 ± 1.2 x 10~ 4 


3.9 ±2.9 x 10~ b 


9.9 ±8.4 x 10" b 


6.8 ±4.7 x 10~ 4 


1.4 ±0.9 x 10~ 4 


3.4 ±2.1 x 10~ b 


9.3 ±5.8 x 10" b 


6.7 ±4.6 x 10~ 4 


1.4 ±0.9 x 10~ 4 


3.3 ±2.1 x 10~ b 


8.2 ±5.9 x 10- B 


T= 10 
o- T = 20 
T = 30 


5.9 ±4.4 x 10~ 2 


1.3 ±0.8 x 10~ 2 


2.6 ± 1.8 x 10- 3 


5.1 ±5.1 x 10" 4 


6.0 ±4.2 x 10~ 2 


1.2 ±0.8 x 10~ 2 


2.4 ± 1.6 x 10- y 


4.9 ±4.0 x 10~ 4 


5.9 ±4.1 x 10~ 2 


1.1 ± 0.7 x 10~ 2 


2.4 ± 1.6 x lO" 3 


4.4 ±3.8 x 10~ 4 


T= 10 
p T = 20 
T= 30 


1.1 ± 1.3 x 10~ 3 


2.9 ±4.0 x 10~ 4 


6.2 ±8.3 x 10~ 5 


1.5 ± 1.9 x 10~ 5 


1.6 ± 1.5 x 10~ 4 


5.1 ±4.6 x 10~ b 


1.5 ± 0.8 x 10~ b 


6.2 ± 2.4 x 10- B 


9.0 ±7.4 x 10~ 5 


3.1 ± 2.2 x 10~ 5 


1.1 ±0.4 x 10" 5 


5.8 ± 1.9 x 10~ y 



Table IV: Confidence limits for the error between the exact and the approximate QML estimators of the 
equation (126[l , h — 5, for the conventional; and h = 5/2, 5/4, 5/8, for the order-1 on (t)^ t . 

Figure 5 shows the histograms and the confidence limits for both, the exact (af T ) and the conventional 
{®8,t) QML estimators of a computed from the twelve sets of 100 time series Z\ T available for the example 
2. Figure 6 shows the same but, for the exact (afj>) and the conventional (&5,t) QML estimators of 
a, whereas Figure 7 does it for the estimators 'pf? and 'pg T of p. Note that, for this example, the 
diffusion parameters a and p can not be estimated from the samples Z\ T with the largest sampling 
period 8 = 1. From the other data with sampling period 5 < 1, the tree parameters can be estimated 
and, the bias of the exact and the conventional QML estimators is not so large as in the previous example. 
Nevertheless, in this extreme situation of low information in the data, the order-1 QML estimators is able 
to improve the accuracy of the parameter estimation when h decreases. This is shown in Figure 8 for the 
samples Z\ T with 5 = 0.1 and T = 10, 20, 30, and summarized in Table III. The order-1 QML estimators 

(®h,6,Ti&h,5,TiPh,5,T) an d (&;S,t>V;6,t,P.,s,t) are again computed on uniform (t)^ t D {t} T / S , T and 
adaptive (t). t D {£}t/<5,t time discretizations, respectively, with T = 10,20,30, h = 5/2,5/4,5/8 and 
tolerances rtol y = rtol-p = 10 -6 and atol y = 10 -9 , atolp = 10~ 12 . The average of accepted and fail steps 
of the adaptive QML estimators at each tk & Wt/<5.t are shown in Figure 4. Note that, the average of 
the accepted steps of the adaptive algorithm is not bigger than 8 for each tk € {£}t/(5,t- Therefore, as 
it is shown in Table III, the average of the adaptive estimator is not so good than that of some other 
estimators on uniform time discretizations (r)t T with h > 5/8. For this example, the convergence of the 
order-1 QML estimators to the exact one is shown in Table IV, which gives the confidence limits for the 
error between theses estimators for different values of h. 

5.3 Simulations with two-dimensional equations 

For the examples 3 and 4, 100 realizations of the equation were similarly computed by means of the Local 
Linearization and the Euler scheme, respectively. For each example, the realizations where computed over 
the thin time partition {to + 10 _4 n : n = 0, .., 30 X 10 4 } for guarantee a precise simulation of the stochas- 
tic solutions on the time interval [to, to + 30]. Two subsamples of each realization at the time instants 
{t}M,T = {tk =to + kT/M : k = 0, .., M — 1} were taken as observation Z of x for making inference with 
T = 30 and two values of M. In particular, M = 30, 300 were used, which correspond to the sampling 
periods 8=1, 0.1. In this way, two sets of 100 time series Z\ T = {z l k : k = 0, .., M — 1, M = T/8}, with 
i = 1, .., 100, of M observations z\ each one were available for each example with the two values of {8, T) 
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mentioned above. 



T = 30 


a 


a 


h 


5=1 5 = 0.1 


5=1 5 = 0.1 


5 


-0.2500 -0.1428 


-0.7328 -0.0052 


5/16 


-0.0965 -0.0044 


-0.0893 0.0012 


5/64 


-0.0333 0.0029 


-0.0757 0.0013 




-0.0096 0.0068 


-0.0739 0.0013 



Table V: Bias of the approximate QML estimators for the equation (|27|l - (|28| l. h = 5, for the conventional; 
h = 5/16, (5/64, for the order-1 on (t)l t ; and h = ■, for the adaptive order-1 on (r). T . 



T = 30 


a 


a 


h 


5=1 5 = 0.1 


5=1 5 = 0.1 


5 


-0.8000 -0.2507 


-0.8000 -0.3748 


5/8 


-0.4234 -0.0481 


-0.2451 0.0005 


(5/32 


-0.2210 -0.0219 


-0.1910 0.0015 




-0.1611 -0.0046 


-0.1898 0.0001 



Table VI: Bias of the approximate QML estimators for the equation (|29[) - (I30|) . h = 5, for the conventional; 
h = (5/8, (5/32, for the order-1 on (t)^ t ; and h — •, for the adaptive order-1 on (t) t . 

For both examples, the order-1 QML estimators (a^ s T , a\ s T ) and {a.^s,T, &-,s,t) on uniform (r) h T D 
{1}t/s,t an d adaptive (r). T D {t}r/s.T time discretizations, respectively, were computed from the two 
sets of 100 data Z\ T with T = 30 and 5 = 1,0.1. The values of h were set as h = 5,5/16,5/64 for 
the example 3, and as h = 5,5/8,5/32 for the example 4. The tolerances for the adaptive estimators 
were set as in the first example. Figures 9 and 11 show the histograms and the confidence limits for 
the estimators {ah ST , ST ) and (oi.,8,t-,g-,8,t) corresponding to each example. For the two examples, 
the difference between the order-1 QML estimator (a h sr^hS t) an( i the adaptive one ((£.,6,T)<7-,S,t) 
decreases when h does it. This is, according Theorem [2j an expected result by assuming that the 
difference between the adaptive and the exact QLM estimators is negligible for (r) T thin enough. In 
addition, Table V and VI show the bias of the approximate QML estimators for these examples. Observe 
as the adaptive {a..$.T, &-,s,t) and the order-1 QML estimator (3^ ^ T , a\ ^ T ) with h < 5 provide much 
less biased estimation of the parameters (a, a) than the conventional QML estimator (Sj^,^^), 
which is in fact unable to identify the parameters of the examples. Clearly, this illustrates the usefulness 
of the order-1 QML estimator and its adaptive implementation. However, as it is shown in Table V 
for 5 = 0.1, no always the adaptive estimator {ci. s,Ti&;8,t) is l ess unbiased than the order-1 QML 
estimator (2^ s T , a\ s T ) for some h < 5. This can happen for one of following reasons: 1) the bias of the 
exact QML estimator when the adaptive estimator is close enough to it, or 2) an insufficient number of 
accepted steps of the adaptive estimator for a given tolerance. In our case, since (S^ s T , <7 ; " ,5 T ) converges 
to ((X.^t, &-,s,t) as h decreases (Figure 9 with 5 = 0.1) and the average of accepted steps of the adaptive 
estimators is acceptable (Figure 10 with 5 = 0.1), the first explanation is more suitable. Figures 10 and 
12 show the average of accepted and fail steps of the adaptive QML estimators at each tk £ {£}t/(5,t 
for each example. Note how the average of accepted steps corresponding to the estimators from samples 
with 5 = 0.1 is ten time lower than that of the estimators from samples with 5 = 1, which is an expected 
result as well. 

6 Conclusions 

A modification of the conventional approximations to the quasi-maximum likelihood (QML) method was 
introduced for the parameter estimation of diffusion processes given a time series of complete observations. 
This is based on a recursive approximation to the first two conditional moments of the diffusion process 
through discrete-time schemes. For all given data, the convergence of the modified QML estimators to the 
exact one was proved when the error between the discrete-time approximation and the diffusion process 
decreases. It was also demonstrated that, for an increasing number of observations, they are asymptoti- 
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cally normal distributed and their bias decreases when the above mentioned error does it. As particular 
instance, the order-/? QML estimators based on Local Linearization schemes were proposed. For them, 
practical algorithms were also provided and their performance in simulation illustrated with various ex- 
amples. Simulations shown that: 1) with thin time discretizations between observations, the order-1 
QML estimator provides satisfactory approximations to the exact QML estimator; 2) the convergence 
of the order-1 QML estimator to the exact one when the maximum stepsize of the time discretization 
between observations decreases; 3) with respect to the conventional QML estimator, the order-1 QML 
estimator gives much better approximation to the exact QML estimator, and has less bias and higher 
efficiency; 4) with an adequate tolerance, the adaptive order-1 QML estimator gives an automatic, suit- 
able and computational efficient approximation to the exact QML estimator; and 5) the effectiveness of 
the order-1 QML estimator for the identification of SDEs from a reduced number of complete observations. 
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Figure 1: Histograms and confidence limits for the exact (cif T ) and the conventional (5j,t) QML es- 
timators of a computed from the Example 1 data with sampling period 5 and time interval of length 
T. 
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Figure 2: Histograms and confidence limits for the exact (a s T ) and the conventional ($s,t) QML es- 
timators of a computed from the Example 1 data with sampling period S and time interval of length 
T. 
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Figure 3: Histograms and confidence limits for the oder-1 QML estimators of a and er computed on 
uniform (t)JJ t and adaptive (r). T time discretizations from the Example 1 data with sampling period 
6 = 1 and time interval of length T. 
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Figure 4: Average (*) and 90% confidence limits (-) of accepted and failed steps of the adaptive QML 
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Figure 5: Histograms and confidence limits for the exact (cif T ) and the conventional (5j,t) QML es- 
timators of a computed from the Example 2 data with sampling period 6 and time interval of length 
T. 



21 



T=10 



T=20 



T=30 




3 5 7 

4.886 + 1.5139 



30 
15 





3 5 7 

5.031 6 ±0.76283 




3 5 7 

4.9379 ±0.21 126 




3 5 7 

4.988 ± 0.063973 

T=10 




3 5 7 

4.9164 ± 1.9822 




3 5 7 

5.0721 ±0.76162 




3 5 7 

4.9385 ±0.21 123 




3 5 7 

4.9869 ± 0.06372 




3 5 7 

4.8147 ± 1.4634 



30 
15 





3 5 7 

5.0298 ± 0.75947 




3 5 7 

4.9873 ± 0.062269 

T=20 




3 5 7 

5.1 269 ±2.0084 




3 5 7 

5.0695 ± 0.75847 




3 5 7 

4.9868 ± 0.062336 




3 5 7 

4.8176 ±1.4639 




3 5 7 

5.0285 ±0.751 74 



^ <T 



4.9391 ±0.20985 



1 



3 5 7 

4.9876 ±0.062312 

T=30 




3 5 7 

5.21 19 ±1.9792 




3 5 7 

5.0681 ±0.75467 



^ ° 5 ,T 



4.9392 ± 0.20983 



30 








15 










3 5 7 

4.9879 ± 0.062881 



Figure 6: Histograms and confidence limits for the exact (afx) an d the conventional (S&,t) QML es- 
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T. 
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Figure 7: Histograms and confidence limits for the exact (p s T ) and the conventional (p s T ) QML es- 
timators of p computed from the Example 2 data with sampling period 5 and time interval of length 
T. 
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Figure 8a: Histograms and confidence limits for the oder-1 QML estimators of a and a computed on 
uniform (t)^ t and adaptive (r). T time discretizations from the Example 2 data with sampling period 
8 = 0.1 and time interval of length T. 
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Figure 8b: Histograms and confidence limits for the oder-1 QML estimators of p computed on uniform 
(r)i rp and adaptive (r) T time discretizations from the Example 2 data with sampling period S = 0.1 
and time interval of length T. 
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Figure 9: Histograms and confidence limits for the oder-1 QML estimators of a and a computed on 
uniform (r)^ T and adaptive (r). T time discretizations from the Example 3 data with sampling period 8 
and time interval of length T = 30. 
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Figure 10: Average (*) and 90% confidence limits (-) of accepted and failed steps of the adaptive QML 
estimator at each tk € {i}iv in the Example 3. 
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Figure 11: Histograms and confidence limits for the oder-1 QML estimators of a and a computed on 
uniform (r)^ T and adaptive (r). T time discretizations from the Example 4 data with sampling period 8 
and time interval of length T = 30. 
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Figure 12: Average (*) and 90% confidence limits (-) of accepted and failed steps of the adaptive QML 
estimator at each tk € {i}iv in the Example 4. 
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